Candidate sentence selection for language learning exercises: from a comprehensive framework to an empirical evaluation
نویسندگان
چکیده
We present a framework and its implementation relying on Natural Language Processing methods, which aims at the identification of exercise item candidates from corpora. The hybrid system combining heuristics and machine learning methods includes a number of relevant selection criteria. We focus on two fundamental aspects: linguistic complexity and the dependence of the extracted sentences on their original context. Previous work on exercise generation addressed these two criteria only to a limited extent, and a refined overall candidate sentence selection framework appears also to be lacking. In addition to a detailed description of the system, we present the results of an empirical evaluation conducted with language teachers and learners which indicate the usefulness of the system for educational purposes. We have integrated our system into a freely available online learning platform. RÉSUMÉ. Nous proposons un système de traitement automatique de la langue ayant pour but l’identification de phrases candidates tirées de corpus. Le système hybride allie une approche heuristique à des méthodes d’apprentissage automatisé et intègre un nombre de critères de sélection pertinents. Nous nous concentrons sur deux aspects fondamentaux : la complexité linguistique et la dépendance des phrases extraites envers leur contexte d’origine. Les travaux antérieurs en génération automatique d’exercices n’ont porté sur ces deux critères que de façon limitée, et un cadre fin de sélection de phrases candidates semble également faire défaut. En plus d’une description détaillée du système, cet article rapporte les résultats d’une évaluation empirique réalisée avec des enseignants de langues et des apprenants portant sur l’utilité du système à des fins éducatives.
منابع مشابه
Establishing an Argument-Based Validity Approach for a Low-Stake Test of Collocational Behavior
Most of the validation studies conducted across varying test application contexts are usually framed within the traditional conceptualization of validity and therefore lack a comprehensive framework to focus on test score interpretations and test score use. This study aimed at developing and validating a collocational behavior test (CBT), drawing on Kane's argument-based approach to validity. F...
متن کاملRule-based and machine learning approaches for second language sentence-level readability
We present approaches for the identification of sentences understandable by second language learners of Swedish, which can be used in automatically generated exercises based on corpora. In this work we merged methods and knowledge from machine learning-based readability research, from rule-based studies of Good Dictionary Examples and from second language learning syllabuses. The proposed selec...
متن کاملComparative Textbook Evaluation: Representation of Learning Objectives in Locally and Internationally Published ELT Textbooks
The present study evaluated the learning objectives represented in the recent Iranian nation-wide ELT textbooks, i.e. Prospect and Vision series, and compared them to those in the internationally-published textbook of Four Corners. To this end, Bloom’s revised taxonomy of learning objectives was utilized as the analytical framework to scrutinize the tasks and exercises of the textbooks using a ...
متن کاملDesign and Implementation of an Intelligent Part of Speech Generator
The aim of this paper is to report on an attempt to design and implement an intelligent system capable of generating the correct part of speech for a given sentence while the sentence is totally new to the system and not stored in any database available to the system. It follows the same steps a normal individual does to provide the correct parts of speech using a natural language processor. It...
متن کاملThe Influence of Data-Driven Exercises Through Using a Computer Program on Vocabulary Improvement in an EFL Context
The present study was conducted to evaluate data driven learning (DDL) combined with Computer Assisted Language Learning (CALL) as an approach to improving vocabulary knowledge of Iranian postgraduates majoring in teaching English, English literature and translation. The purpose was to help language learners get familiar with DDL as a student-centered method taking advantage of a computer progr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1706.03530 شماره
صفحات -
تاریخ انتشار 2017